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Abstract 

Single molecule force spectroscopy methods can be used to generate folding trajectories of biopoly- 
mcrs from arbitrary regions of the folding landscape. We illustrate the complexity of the folding 
kinetics and generic aspects of the collapse of RNA and proteins upon force quench, using simulations 
of an RNA hairpin and theory based on the de Gennes model for homopolymer collapse. The folding 
time, TF, depends asymmetrically on 6fs = fs - fm and Sfg = fm - fg where fs (/q) is the stretch 
(quench) force, and fm is the transition mid-force of the RNA hairpin. In accord with experiments, 
the relaxation kinetics of the molecular extension, R{t), occurs in three stages: a rapid initial decrease 
in the extension is followed by a plateau, and finally an abrupt reduction in R{t) that occurs as the 
native state is approached. The duration of the plateau increases as X = tq/tf decreases (where tq is 
the time in which the force is reduced from fs to fg). Variations in the mechanisms of force quench 
relaxation as A is altered are reflected in the experimentally measurable time-dependent entropy, which 
is computed directly from the folding trajectories. An analytical solution of the de Gennes model 
under tension reproduces the multistage stage kinetics in R{t). The prediction that the initial stages of 
collapse should also be a generic feature of polymers is validated by simulation of the kinetics of toroid 
(globule) formation in semifiexible (flexible) homopolymers in poor solvents upon quenching the force 
from a fully stretched state. Our findings give a unified explanation for multiple disparate experimental 
observations of protein folding. 
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The folding of RNA molecules [T] and proteins [21 El HI E] should be thought of as dy- 
namic changes in the distribution of conformations that result in collapse, the formation of 
intermediates, and barrier crossings to reach the folded structures. Such a statistical mechan- 
ical perspective of the folding process is finding support in single molecule measurements that 
manipulate the structural ensembles using an external mechanical force (/) [SI El IE]- The de- 
velopment of the force-clamp methods [SI E] , which allow the application of a constant force to 
specific locations on a biomolecule, have made it possible to explore in the most straightforward 
manner the folding of proteins and RNA initiated from regions of the folding landscape that 
are inaccessible in conventional ensemble experiments. The use of an initial stretching force, fs, 
which fully unfolds the biomolecule, followed by a subsequent quench to a sufficiently low force, 
/q, which populates the Native Basin of Attraction (NBA), holds the promise of unearthing all 
aspects of the folding reaction, including the dynamics of the collapse process and the extent to 
which the folding pathways are heterogeneous. 

Although there is great heterogeneity in the folding trajectories, Fernandez and Li [6] noted 
that upon the quench fs ^ Iq, ubiquitin (Ub) folded in three stages as reflected in the time- 
dependent changes in the extension, R{t). Typically, after a very rapid decrease in R{t), there 
is a long plateau in R{t) that is suggestive of the formation of metastable states, which we 
term Force- Induced Metastable Intermediates (FIMI's). Finally, R{t) decreases sharply in the 
last stage as the extension corresponding to the native state is reached. Although it is not 
emphasized, a similar behavior has been observed in the folding of TAR RNA (see Fig.3A in 
|8]) upon abrupt decrease in /, even though the forces used in Atomic Force Microscopy (AFM) 
experiments on Ub and Laser Optical Tweezer (LOT) experiments on the RNA differ greatly. 
In contrast, it has been argued that Ub folds in a discrete manner based on refolding initiated 
by decreasing force slowly using AFM [7J, and not in the way described in [6J. The seemingly 
contradictory conclusions on the refolding of an initially extended Ub require a full theoretical 
description of the molecular events that ensue when force is decreased from fs- 

In order to quantitatively describe the results of single molecule measurements, the interplay 
between a number of factors have to be taken into account. The variables that can be controlled 
in experiments are fs, fq, and Vl, the speed with which the position of transducer is contracted 
(over a length AL) or equivalently, Tc, the rate of force contraction. Because the routes navigated 
by the biomolecule will depend on the precise protocol used [H |9], these factors must be taken 
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into account when comparing different experiments, even if the biomolecule under consideration 
is the same. For a system whose global folding obeys two-state kinetics, the two relevant time 
scales are the force relaxation time, tq (with tq ^ AL/vl or equivalently, tq ^ {fs — fQ)/rc), 
and the overall folding time, Tp{fs, fq). The dynamics of the folding trajectories, R{t), should 
depend on A = tq/tp. If A <^ 1 then folding takes place far from equilibrium, whereas in the 
opposite limit refolding occurs under near-equilibrium conditions. Thus, by altering the force- 
quench protocol it is possible to control the collapse process and the folding routes for a given 
protein or RNA, even if fq and fs are fixed. 

Here, we explore quantitatively the scenarios that emerge for the folding of a RNA hairpin 
as a function of fq and fs- The value of A is controlled by altering Tp{fq,fs) in this work, 
rather than by adjusting the force contraction timescale tq, as in Refs [HI E]. We place 
particular emphasis on the general features of the physics underlying the early stages of 
length contraction. The folding trajectories for a RNA hairpin exhibit a long plateau of 
varying length in -R(t), associated with the development of a FIMI. More generally, we show 
that the folding trajectories can be controlled by the choice of an appropriate force-quench 
protocol, which is succinctly expressed in terms of A. The origin of the long-lived FIMIs when 
A is small is explained by adapting de Gennes' picture of the kinetics of a coil to globule 
transition in flexible homopolymers. The combination of simulations and theory shows that 
the formation of long-lived FIMIs should be a generic feature of any biomolecule that adopts a 
compact structure, which implies that the polymeric nature of RNA and proteins, along with 
interactions that induce a globular state for sufficiently low /g, will generically lead to a long 
plateau in R{t) as long as A is small. These conclusions are supported by explicit simulations 
of force-quench studies of a flexible homopolymer in a poor solvent and a semiflexible chain 
model for DNA, in which the compact state adopts a toroidal structure. Our theory, that 
emphasizes A as the relevant variable in determining the folding routes, naturally explains the 
apparent differences in the interpretation of force-quench folding of Ub by different groups [6l [7] . 

RESULTS 

Asymmetry in the force-quench kinetics : Force-quench refolding of a RNA hairpin 
(or a protein) can be initiated using two distinct modes. In the flrst, the RNA is equilibrated at 
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FIG. 1: Force-quench kinetics of P5GA hairpins. A. Refolding from a fixed fs to varying fq (left) or 
from varying fs to a fixed /g (right). B. Tp versus fg starting from different fs- A. Fit to the Bell 
equation gives Ax[^^^ (0.6 — 0.7) nm. The extrapolated value Tp to zero quench force is plotted 
in C (* symbols). C. Tp versus fs for each value of fg. The variations of Tp with fs are fit to Eqjl} 
From the fit, we find {Axun /nm,T{fQ) / fis) =(1.5, 89.7), (2.5, 126), (1.8, 150), (2.6, 224), (3.8, 357), 
(4.4, 622) for fg =0, 2, 3.5, 7, 10, 12 pN, respectively. D. Time traces of the molecular extension, R{t), 
upon force quench. Multiple time traces are plotted in grey, and the averaged time trace is shown in 
thick black. A few representative time traces are in color. The ensemble of time traces are shown for 
fs = 70 pN and fq = 3.5 pN. 

an initial fs>fm, where fm is the transition force at which the probabilities of being folded and 
unfolded are equal, and subsequently the force is quenched to various fq values {fs {fg}), 
with fq < fm (Fig.l). We use simulations of the P5GA hairpin [10], with fm = 14.7 pN at 
T = 300 K (see Methods) to extract the dependence of Tp on fq from the dynamics of folding 
trajectories (Fig.lD and Fig. 2). Similarly, refolding can also be initiated by varying fs{> fm) 
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and reducing the force to a single /q(< /„) {{fs} /q) (see Fig.lA). 

The mean refolding time {rp = f*^^ different values of fg is fit to a Bell equation 

igillllia (Fig.lB), TpifQ, fs) = r°{fs)efQ^^l-N/kBT ^^^eie T°p{fs) is the folding time at /q = 0, 
and Axfj^p is the distance between the transition state and the free energy minimum in the 
unfolded basin of attraction (UBA), which assumes that i? is a good reaction coordinate. The 
slight upward curvature in log{Tp) as a function of fq is due to the dependence of Axfj^jy on 
/q [iHl [in [E]. When tf is rescaled by Tp{fs), the refolding times at different fs values nearly 
collapse onto a single curve (see the panel on right hand side in the Fig.lB). It follows that 
the time scale for crossing an effective free energy barrier (in the late stages of folding) is solely 
determined by /g, and is independent of fs- The microscopic model in [151 IIS] provides an 
excellent fit to the simulation data (see also Fig.S2 and the caption therein). Here, we use the 
Bell equation to provide a physically simpler picture of the effect of force on the free energy 
profile. 

Unlike the behavior of rp with varying fq, the dependence of Tp on fs cannot be explained 
using the Bell model. We find that Tp decreases sharply with fs for fs < 20 pN, but saturates 
to a limiting value (which is also dependent upon fq) if fs > 20 pN (Fig.lC). The saturation of 
Tp for large fs suggests that the initial conformations of P5GA belong to the same structural 
ensemble (see the variations of the end-to-end distribution, Ps{R), at each fs in Fig. SI A). 
At low fs {fm ^ /5 < 20 pN) the molecules are partitioned into different ensembles. The 
initial population of molecules in the UBA, determined by fs, affects the average folding time. 
Since P5GA is two-state folder under force, the fraction of molecules in the UBA at fs is 
^UBAifs) = !/[! + e^^^vN-fs^^uN)/kBT^^^ where AF°^ = F° - and Axun = xu-xn are the 
difference in free energy and molecular extension between unfolded and folded conformations, 
respectively. By combining the /g-dependence of Tp from Bell's model with (puBAifs), we obtain 
a unified expression that describes force-quench folding time in both modes (see SI for details), 

rF{ojs,ojq) ^ ^g_5/gAxcjvAflT^ 

where 6fs = fs - fm, Sfq = fm - fQ, riO) = Toe^^i^N/^BT hemg the time scale in the 

absence of activation barrier, and we have used AF{}^ — fmAxuN = 0. The fits using the two 
parameters Tp{fq, fs){= T{0)e~^^'^^^u^N/f'BT^ ^-^^ Axun, provides an excellent description of 
the simulation data (Fig.lC). The analytic result in Eq{l] also shows the inherent asymmetry 
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in the two modes for initiating folding by force-quench. 
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FIG. 2: Analysis of the folding trajectories associated with force-quench for various values of {fs, fq)- 
A : /5 = 56 pN ^ /q = 12 pN, B : = 14 pN ^ /q = 12 pN, C : = 56 pN ^ /q = 2 pN, D : 
/s = 14 pN ^ /q = 2 pN. E : /5 = 56 pN ^ /q = pN. Pere{R) and Ptse{R) are shown in the 
right-hand side panels of A-E, and the corresponding structures are shown for the ERE (left) and TSE 
(right). Thermally denatured ensemble Ptde{R) for T ^ Tf is shown in A. 



Non-equilibrium force-quench folding is reflected in R{t): The dramatic variations 
in the refolding time TpifQ, fs) (Figs. IB and IC) are reflected in the folding trajectories, R{t), 
as fs and fq are varied. In Fig. ID (see also Fig. 2), which shows the ensemble of force-quench 
trajectories with fs = 70 pN and fq = 3.5 pN, we observe a rapid decrease in R{t) in the early 
stages of folding (stage 1), followed by a slowly decaying plateau of varying length (stage 2). 
The second stage is followed by an abrupt reduction to i? < 2 nm, corresponding to the hairpin 
formation [T7] [HI [131 HE] (stage 3). The three stages of folding are well separated temporally, and 
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the plateau length increases with increasing fg. Although multistage folding has been discussed 
in the context of T-jump protein folding O [191 EO], the dramatic manifestation of timescale 
separation at an "early" stage of protein folding is unique to force quench experiments. The 
nature of FIMI is different from that of a thermally collapsed intermediate state in that the 
FIMI state contains no intramolecular contacts (see below). The rapid increase in the folding 
times (Figs. IB and ID) is directly linked to the long-lived FIMIs. The plateau persists until the 
system reaches the transition state, R ~ Rts = 4nm, where Rts is the position of transition 
barrier at f = fm (Fig.S3). 

The process by which RNA navigates the folding landscape in stage 2 of the collapse process, 
during which R{t) is nearly a constant (Fig. ID), is assessed by monitoring the conformational 
changes in the hairpin. In order to illustrate these structural changes, we computed the dis- 
tributions, Pere{R) (ere stands for entropically relaxed ensemble) "immediately after" the 
force-quench (between stages 1 and 2), and Ptse{R) (TSE denotes the transition state en- 
semble) that gives the distribution function "immediately before" the final decrease of R{t) 
(between stages 2 and 3; see the SI for further discussion). The differences between Pere{R) 
and Ptse{R) are illustrated in Fig. 2. As long as fs ^ fm, Pere{R) consists of a set of homoge- 
neously stretched structures, whereas the structural ensemble of hairpin loops with disordered 
ends become dominant for fs < fm (see Fig.2A-2D). For fs = 56 pN, Ptse{R) is peaked at 
smaller values of i? ~ (6 — 7) nm, and becomes broader with decreasing /q (see Fig.2A, 2C, 
and 2E). The ensemble of structures that are sampled in this second stage are the FIMIs, which 
are analogues of minimum energy compact structures, postulated by Camacho and Thirumalai 
pi] for folding initiated by temperature quench. Comparison of P{R) in Figs 2A, 2C and 2E 
with Ptde{R) (TDE denotes the thermally denatured ensemble; see the pink distribution in the 
Fig. 2A), show that the conformations in the FIMI are unlikely to be sampled during a typical 
refolding initiated by temperature quench [TTj. 

At fs = 14 pN < fm (Fig.2B and 2D), Pebe{R) differs substantially from the distributions for 
fs = 56 pN ^ fm (Fig.2A, 2C, and 2E). The ERE has an approximately bimodal distribution, 
similar to the initial distribution Ps{R) f = fm [101122] (compare Pere{R) in Fig.2B, 2D with 
Fig. SI at fs = 14 pN), and is far more heterogeneous than the ensemble at fs = 56 pN. With 
fs^fm, the folding is initiated predominantly from molecules with a partially formed hairpin 
loop. In this case, the molecule refolds directly to the native state without a well-defined plateau 
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in R(t). A plateau in R{t) will be manifested most clearly only when the force is quenched from 
a large /^(^ fm) and fm)- The simulations also suggest that the lifetimes of the FIMIs 

depend on the protocol used to quench the force, which is determined by A (see Table I). 

The evolution from the ERE to the TSE, which occurs over a time scale determined by the 
length of the plateau, is associated with a search process leading to a nucleation step in the 
hairpin formation. We have shown previously, in a force quench simulations with /g = 0, that 
the dynamics of the plateau is associated with a search for the native dihedral angles (trans— 
gauche transition) in the tetra-loop region [T7] . However, the plateau formation in R{t) is 
a generic feature of force-quench dynamics, and has also been observed in recent experiments 
on poly-ubiquitin, 127, and the PEVK protein from titin [6l [23]. Interestingly, the formation 
of long-lived FIMIs also occurs in force-quench dynamics of semiflexible chains and flexible 
homopolymers (see Discussion). 

Link between entropy production and the collapse dynamics : The transition from 
the stretched state to the folded state that occurs through FIMIs can be described using time 
dependent changes in the entropy. From the ensemble of force-quench time traces, R{t), we 
estimate the entropy of the molecule as a function of time by using 

POO 

S{t)/kB = - dR P{R,t) log P{R,t), (2) 
Jo 

where P{R, t) is the end-to-end distribution function obtained directly from the ensemble of 
trajectories. At long times P{R, t oo) will reduce to the distribution for a given system. 
The time-dependent entropy is computed directly from the data generated from experiments 
or simulations, giving this approach a practical use, and thus captures the non-equilibrium 
behavior in all force regimes and over all time scales. Note that Eq. [2] makes no assumptions 
about a particular model for P{R,t) but is strictly valid only if R can adequately describe 
the conformation of the molecule of interest. Fig. 3 shows S{t) computed for four different 
pairs of (/s, /q). Interestingly, changes in S{t) are nonmonotonic when the system begins 
from a high stretch force {fs = 56 pN, Fig.3A), whereas the S{t) decreases monotonically 
upon force-quench from fs = 14 pN (Fig.3B). The quantitative analysis using S{t) clarifies the 
dynamics in the ensemble of time traces (Fig. 2). The evolution of the end-to-end distribution, 
Ps{R) Pere{R) — Ptse{.R) Pnba{.R), is due to changes in the dynamic evolution of the 
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FIG. 3: The time evolution of the entropy, measured using Eq. [2} A. S{t)/kB from fs = 56 pN to 
/q = 12 pN (red) and 2 pN (blue). B. S{t)/kB from fs = 14 pN to fq = 12 pN (red) and 2 pN 
(blue). C. The time evolution of S{t)/kB for varying fs with fixed fq = 12pN. In this panel, we take 
the running-average of S{t)/kB every 4.7 /is. 

entropy (Eq. |2| which progresses from a low (stretched) — s>medium (ERE) — s>high (FIMIs, TSE) 
— i>low (NBA) value as the hairpin folds, especially when the initial ensemble of states is fully 
stretched {R ^ 4nm in Fig.S2). At fs = 14 pN, Ps{R) is bimodal and S{t) with a maximum 
value at t = 0, and decreases monotonically as S{t)/kB ~ $e^*/^-f + (1 — $)e^*/^^ where $, tj 
and Ts are /g-dependent for the same value of fs- The time constants tj and Tg correspond to 
collapse of fast- and slow-track molecules in the multidimensional free energy landscape (with 
Tf < Tg), and $ is the fraction of fast collapsing molecules [1]. The fast and slow tracks are 
associated with the dynamics of collapse, starting from within the NBA {R < 4nm) and UBA 
{R > 4nm), respectively. We find $ = 0.29, Tf = 33.8 /is, = 715 jis for fq = 12 pN, and 
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$ = 0.47, Tf = 81.3 fis, Ts = 181 for /q = 2 pN. A smaller value of fg increases the fraction 
of conformations in the folding route starting from NBA to the native state. The force-quench 
dynamics from the force-denatured ensemble (FDE) with fs = 14 pN is an example of the 
kinetic partitioning mechanism [Tj, and shows that the mechanisms of hairpin formation can be 
surprisingly complex [S]. 

The transition from the monotonic to non-monotonic decay of S{t) occurs at fs ~ 16 pN 
(Fig.3C), and the time scales for S(t) to attain its maximum value are similar for = 18 — 56 
pN. However, the entropies of the initial structural ensemble, >S'(0), are different and decrease 
as fs increases. This finding provides insights into why rp saturates for fs > 18 pN in Fig.lC, 
since the P5GA hairpin is predominantly in the UBA for fs > fm + Sfi/2 = 14.7 + 1.2 pN ^ 16 
pN, where Sfi/2 is the half-width of force fluctuations around (see SI for details). 



The expanding sausage model under tension captures the physics of force-quench 
refolding of biopolymers : Although the non-equilibrium behavior of the hairpin, reflected 
in R{t) and the time scales for hairpin formation is shown for P5GA (Fig.l and 2), the dynamics 
of contraction of the molecule upon rapid quench should be a generic phenomenon, provided 
A is small and a driving force leads to the development of a compact structure for sufficiently 
low fq. In order to illustrate the generality of the FIMI that gives rises to the second stage of 
contraction (Fig.l, 2 and ref. [6]), we provide a theoretical description by adapting de Gennes' 
model [21] for collapse kinetics of a flexible coil. Using an "expanding sausage model," de Gennes 
envisioned that the collapse (upon temperature quench to T = Te — AT < Tq, the Flory 6- 
temperature) progresses from a randomly distributed string of thermal blobs of size ^(= ag^^'^, 
where g[= (Te/AT)^] is the number of monomer constituting each blob), to a sausage-like 
collapsed structure. The driving force in this picture (see Fig.S4A) is the minimization of the 
surface tension in a poor solvent (T < Tq). 

To describe the second stage in the force quench dynamics (or more generally for any biopoly- 
mer that undergoes a stretch collapse transition) using the generalization of expanding 
sausage model, we write the free energy under tension as 

where the first term is the driving force for collapse [Fq = 'jA where A is the surface area of 



(3) 
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FIG. 4: The expanding sausage model for force quench dynamics. A. The free energy profile as a 
function of R for increasing tension. The free energy profile F{R) is plotted in dimensionless form 
using F{R/ L) = |^ = 2 (^)^'^^ + fp^ji ~ (t)' '^^^ stable free energy minima occur at 

R = and R = Ro{fQ) < L when fg > fq. The profiles show that R^^ decreases and Rq increases 
with increasing /. In B, C, and D the dynamics of R{t) (left) and the average refolding time (r) = Tp 
(right) are shown for varying fg, fg, and chain flexibility (/p = ksT/lp), respectively. 

a thermal blob). We rewrite Fc ~ 2kBT(7r^lRY^'^ / ^"^ , where the surface tension for a thermal 
blob is 7 ~ ksT/^'^, and fl is the volume of the sausage, assumed to be constant throughout 
the collapse process The second term in Eq. |3| Fy = — /g-R, accounts for the mechanical 
work. The third term in Eq. |3| which enforces chain inextensibility under the applied tension, 
is required to incorporate the sharp increase of mechanical force at i? ~ L observed in P5GA 
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hairpin (see Fig SIB). A simplified version of the sausage model, where inextensibility is not 
taken into account, is presented in the SI. The details of the free energy (Eqjsj) are not expected to 
change the qualitative picture of the collapse kinetics that emerges using the expanding sausage 
model to describe force-quench dynamics, as long as each term is taken into account. The 
effect of the stretch force fs on the folding kinetics is introduced through the initial equilibrium 
extension Rq, which for fs > kBT/lp{= fp) is Roifs) ^ L (l - ^ fp/Afs^ < L. 

We equate the rate of free energy reduction of the system, F{R), with the entropy production 
due to dissipation, TS{R) ^ r] x R x R^ [24j, where we have used the Stokes-Einstein relation 
and a damping force / ~ — ~ —rjRv. The friction coefficient Q ~ rjR where R is the size 
of collapsed polymer in the longitudinal direction, and f (~ R) is the velocity. By equating 
F{R) ^ -TS{R) we get 

V^c= / ■ = V (4) 



where Rq/ L 1 — \plpf^fs for fs ^ fp the forces are scaled as /q, = faTc/^lL'^ for a = S, Q, 
and p, and with = jrjrj^- The ratio R(t)/RQ = T,(t), which is interpreted as the survival 
probability of the unfolded state upon force quench fq, can be determined by a numerically 
integrating Eq. |4] (see the left column of Figs.4B-D). The average folding time (or mean first 
passage time) is TpUfa}) = = /J" d.'t S(t; {/„}), where t* satisfies R{t*) = 0. The 

results of Tp for various combination of {fg, fs, fp) cire given in the right column of Fig. 
4B-D. In agreement with our simulation results (see also fl3[ [T71 [18]). as well as force quench 
experiments on poly-Ub [6J, the sausage model for WLC shows a sharp decrease in R{t) at an 
early stage of the force-quench dynamics, after which a plateau in R of variable length develops, 
before an abrupt collapse to i? = (see Fig. 2). A long time plateau is observed for a stiffer 
biopolymer (small fp), and for larger fq (Figs.4B and 4D). In agreement with the more detailed 
P5GA simulations, Tp increases exponentially at small quench forces {fq < 1), with a slight 
upward curvature for larger fq (compare Fig.4B). The dependence of Tp on the stretch force fs 
is typically small, and saturates when fg ^ 10^ (compare Figs.lC and 4C). 

A word of caution is appropriate. Eqj4| which is derived in the absence of fiuctuating force 
(noise term in the Langevin equation), is valid only for fq < fq where fq is the force beyond 
which a finite free energy barrier for refolding develops (the dotted energy profiles in Fig.4A). 
Above fq = fq, there is a barrier to refolding, SF^, and a vahd solution to Eq. [i] cannot be 
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found for all t > 0. De Gennes' approach [21] is not applicable for rate processes p5] that involve 
crossing a free energy barrier. Physically, the residence time in the FIMI state becomes infinite 
in the absence of noise. The analog of / J was referred to as "spinodal" or "coercive field" in Ref. 
[26j in which the nonequilibrium relaxation of spin systems was studied. Therefore, to account 
for the force quench dynamics when /g > /g , we use the Kramers theory for barrier crossing, 
assuming that R (under tension) is a good reaction coordinate, an approach that is consistent 
with other estimations of the transition time [271 [28] . In so doing, it is assumed that under 
tension (/g 7^ 0) that results in the suppression of internal fluctuations i? is a good reaction 
coordinate. The refolding time Tp scales with the effective height of the free energy barrier as 
Tp ~ exp {6F^) where 6F^ = F{R^^/L) - F{Rq/L). For /g > /g one can show (see SI for 
details) 

^ '1-27' 



Tp ~ exp 




(5) 



For small /g or increasing flexibility (/^ ^ 1), Tp in Eq,5 deviates significantly from Bell's 
expression [rp ~ exp (/g)] [IHl Ull [15]. We stress that Eq,5 is only approximate and is shown 
to emphasize that activated transitions (in contrast to Eq. 4) dominates the approach to the 
ordered state. 

While additional factors such as topological constraints due to self-avoidance |i29j or other 
interactions may alter the dynamic characteristics (for instance the dependence of Tp on fs or 
fo) the generic feature of the emergence of a FIMI state upon force quench, also evidenced in the 
force-quench collapse dynamics of polymer chain under poor solvent conditions (see Discussion), 
is satisfactorily captured by the simple model. 



DISCUSSION 



Implications for experiments: Our finding that the very nature of folding can be altered 
by the depth of quench as well as the rate of quench readily explains the different conclusions 
reached in the refolding of Ub [7J. As shown in Figs.l and 2, the lifetime of the FIMIs, 
leading to the plateau in R{t), arises only if the ratio A = tq/tp^/q, fs) is small. Under the 
experimental conditions in ref [6], A ~ 0.15, whereas in the more recent AFM experiments, 
A ~ 32 [7] (see Table I for details). Consequently, the experiments by Schlierf and Rief [7] are 
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FIG. 5: Polymer collapse in a poor solvent upon force quench. A. The collapse of a WLC in a poor 
solvent. B. Toroidal, single racquet, or multiple racquet structures are formed in various trajectories. 
C. The entropy production using Eq. [2] displaying the low^high— >low pathway from high fs to low 
/q. D. Linear reduction in force over a period, tq, during which the free energy gradually changes 
from a profile with / > /m to one with f < fm- Depending on the time scale for hopping between the 
native and unfolded basins of attraction, Tf^op, in comparison to tq, the pattern of relaxation dynamics 
upon force quench can be greatly affected. 

closer to equilibrium conditions than the force quench refolding reported in [6j. We predict 
that the apparent differences between the conclusions reached in [6] and [7J are solely due 
to differing values of A. The generality of these findings can be further appreciated from 
the study of Li et. al. on TAR RNA (A = 0.1, see Table I) that showed evidence for the 
development of a plateau (see Fig.3A in ref. [8]), although its importance was not discussed. 
Our findings also imply that only by varying A over a wide range it is possible to discern 
the multiple stages in folding, and firmly establish a link between collapse and folding transitions. 
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FIMIs are a generic feature of force-quench dynamics : The sharp transitions in 
R{t), from its value in the second stage to the native compact structure in the third stage is 
reminiscent of a weak first order phase transition. Analysis of Ptse{R) for P5GA (Fig.2A) show 
that the formation of the hairpin loop from a FIMI is the key nucleation event in bringing the 
stretched RNA into its native-like hairpin structure. Because the simulations on RNA hairpins 
and the expanding sausage model suggest the long plateau only depends on the characteristic 
time scales associated with the folding polymer and the experimentally controlled variables in 
force quench, a similar behavior should be observed even in polymers that do not have a unique 
native structure, but can undergo a transition to a globular state. To this end, we consider 
the dynamics of R{t) in a homopolymers (flexible and worm-like chains) in a poor solvent upon 
force quench. The simulation details are in the SI. 

Collapse of Semiflexible chain : In the absence of force, the thermodynamically stable state 
of a semiflexible chain in poor solvent is a toroid (see Fig.SA). We consider the collapse of a stiff 
wormlike chain (bending rigidity = 80kBT, see Eq.(18) in SI) that is initially equilibrated at 
/ = /5 = 83 pN, and the force is suddenly quenched to /g = 4 pN. From the time traces of R(t) 
in Fig.SA, we surmise that the transition to the toroidal structure occurs in three stages, just 
as for the P5GA hairpin and Ub [6]. Within our simulation times (< 170 ^s), some molecules 
adopt toroidal (magenta) or racquet structures (green), while others (brown) fluctuate around 
large values of R without adopting any ordered structure. These flndings for the formation 
of toroids for the WLC are in quantitative accord with experiments on the dynamics of force 
quench collapse of A— DNA, which showed that toroid formation occurs in a stepwise, quantized 
manner [SU], and is in agreement with the forced- unfolding of lattice model of proteins [31]. 

The entropy production, S{t), of a collapsing WLC is shown in Fig.SB only for those molecules 
that form an ordered structure (the trajectories grouped in red curly bracket in Fig.SA (or see 
Fig.SS for complete set of trajectories)). The low entropy of the initial, highly stretched state 
increases at flrst, until it attains a maximum value, corresponding to an ensemble dominated by 
the FIMIs, after which the entropy slowly decays to a value in the more ordered, low entropy 
state. This flnding is in agreement with the three-stage collapse dynamics of the simpler RNA 
hairpin with a well-deflned folded structure. 

Collapse of flexible polymers in a poor solvent : To further demonstrate that bending rigid- 
ity in the semiflexible chain plays no special role in leading to FIMIs we have considered the 
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stretch— ^globule transition of a flexible hompolymer in a poor solvent under force quench con- 
ditions (see SI text for details). The final globular structure, in the absence of force, forms 
continuously from the ends of the chain (a pearl-necklace ensemble), with no evidence of a 
plateau in the collapse (see Fig.S6A). However, the continuous collapse transition becomes co- 
operative and weakly first order by choosing a large quench force /g = 75 pN (Fig.S6B), due to 
the formation of FIMIs. 

These two examples for homopolymers and the results for RNA hairpin show that the value 
of A, and not the specific architecture of the folded structure, dictate the collapse in the inter- 
mediate stages. The expected changes in R{t) of a biomolecule (or a homopolymer in a poor 
solvent) upon a rapid fs fq quench (small A) must occur in multiple stages (Fig.SS) with 
stages 1 and 2 being determined by entropy growth (stage 1). The search for the folding nuclei 
[32] that can further drives structure formation (stage 3) and sequence-dependent effects arise 
in the final stages of folding (Fig.SC). 

If the force is quenched rapidly, the biomolecule will be far from equilibrium when the first 
nucleation event occurs, which will drastically change the dynamics of the folding process. 
Folding occurs in a near equilibrium manner either if the force is relaxed slowly (Fig. 5D), 
so that the internal modes of the biomolecule equilibrate or if the quench force is sufficiently 
large that there is a significant barrier to folding. For a slow decrease in the force, the relevant 
timescale are tq = AL/vl, whereas the relevant timescale for the barrier crossing event will be 
'^hopWo) (Fig- 5D). Thus, near folding occurs at near equilibrium for A = Tqlrhop ^ 1, while 
non-equilibrium folding occurs in the opposite limit. Variations in the experimental protocol 
have to be taken into account in elucidating the pathways explored during the folding process. 

METHODS 

The P5GA hairpin, as represented using the SOP model (see ref |22j for the energy function 
and simulation method), is a two state folder with a transition mid-force of fm = 14.7 pN [10]. 
We probe the refolding kinetics for multiple pairs of initial and quench forces, {fsj'g), whose 
magnitudes range from 14 < /5 < 70 pN and 2 < /q < 12 pN (see Fig.lA). The force-quench 
refolding kinetics for 100 molecules were simulated beginning with an initial tension (fs) and 
quenched to a final tension (/q). The refolding time of the z*'^molecule, Tpii), was defined as 
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TABLE I: Rate of force-quench in various experiments 



Experiments 


vl (slew rate) 


tq (quench time) 


Tpifo) (folding time) 


"A = tq/tfUq) 


Poly-ubiquitin [6] 


~ 10^ nm/s 


3 msec 


0.2 s (/q = 15 pN) 


0.015 


TAR RNA [8] 


> 200 nm/s 


0.1 sec 


~ 1 sec 


0.1 


RNAase H (/ ^ iV) [33] 


(10 - 10^) nm/s 


(0.05 - 5) sec 


5 sec {I U) 


(0.01 - 1) 


RNAase H (TV ^ [/) [33] 


(10 - 10^) nm/s 


(0.05 - 5) sec 


3000 sec {N U) 


(1/60000 - 1/600) 


Ubiquitin dimer [Tj 


5 nm/s 


~0.16 sec 


>5 msec [fq = pN) 


<32 


P5GA hairpin (simulation) 


~ 10^^ nm/s 


47 ps 


(10 - 1000) //s 


(10-9-10-6) 


WLC (simulation) 


~ 10^^ nm/s 


0.5 ps 


> 100/is 


< 5 X 10-9 



"^For A <C 1 we expect non-equilibrium effect which leads to population of long-lived FIMIs. In the opposite 
hmit folding occurs close to equilibrium conditions. The values of A for Ub explain the differences in the refolding 
found in references [6 , and .7, . The value of A in all our simulations are extremely small and cannot be realized in 
AFM experiments. However, we expect qualitative changes in R{t) only when A > 1, and hence our simulation 
results are qualitatively similar to the small A(^ 1) used in 

the first time the hairpin gained more than 95% of native contacts after the tension was reduced 
to / = /q. 



SUPPORTING INFORMATION 



T-quench versus /-quench dynamics : Our work also provides insights into the quahta- 
tive differences between the two-state dynamics upon force or temperature variations (compare 
Ptde{R) and Ps{R) in Fig.2). For an apparent two-state folder, the fraction of folded states is 
VnbaU) = l/[H-e"(^^5F-/^^'yF)/fcsT]_ The half-width of fluctuations (5/1/2) around transition 
mid-force fm = AFuf/^xuf using cL'^nbaU) / df is 

AxuF ^ 6 nm for P5GA hairpin [22] (see Fig.S3), thus 5/i/2 ^ 1.2 pN at fc^T = 4.14 pN ■ nm. 
For the folded state to be predominantly populated after the force is quenched, the depth 



18 



of quench should satisfy SJq = fm ~ fq > <5/i/2 ~ 1-2 pN. It is noteworthy that only the 
characteristic force ksT / ^xuf determines the sharpness of the transition (5/1/2), while AFup 
plays no significant role. Since Axuf ~ N, it follows that 5/i/2 ~ 1/^! which differs from the 
thermodynamic scaling upon temperature variation, {Tq — T)/Tq = 6T/Tq ~ 1/\/N [2H 
Thus, for a given length A^, the force-induced transition is sharper than the temperature-induced 
transition [IB]. Since exp (/qAx^^^/^bT) = exp {f„^Axlj_^j^/kBT) exp {-SfgAxfj^^/kBT), a 
larger depth of quench exponentially increase the rate of folding by lowering the free energy 
barrier associated with refolding dynamics from the UBA, in contrast to the homopolymer 
following a temperature quench (tc ~ tr6T/Tq) [24j . 

Calculation of Prse ^nd Ptse- To calculate Prse{R) over the time traces, we collect 
the data points from 50 < t < 70 yus for fs = 56 pN, and 10 < t < 30 /is for fs = 14 
pN. Similarily, to calculate Ptse{R) over the time traces, the data are collected from 
Tpii) — 270 < t < Tpii) — 250 /is for all fq values, where TF{i) denotes the folding time of i^^ 
trajectory. 

Dependence of folding time of P5GA hairpins on stretch- {fs) and quench-forces 

(/q) : For a given survival probability = R{t] fs, /q)/-R(0; fs)) the average folding time 
is obtained by using 

poo 

rFifs,fQ)= / dtEit-JsjQ). (7) 
Jo 

The force fs determines the initial distribution of molecules in the UBA and NBA, and is giving 
the Boltzmann distribution. To probe folding we are interested in the dynamics of molecules 
that are initially in the UBA. The fraction of molecules in the UBA for the two state RNA 
hairpin is 

^UBAifs) = 1 ^ ^iAFujfsA.uN)/kBT 
1 + Q{-5fsAxuN)/kBT 

where fs = fm + {fs - fm) = fm + Sfs, and AFun - fmAxuN = 0. It follows from Fig. IB 
in the text that Tf ~ c-^'^'^^u^n/^bT _ gj^^^g g^^^ g^j^g independent control variables in an 
experiment, one can factorize the survival probability as T,(t, fs, fg) = ipuBA(yfs)^(t]'^,fQ)- 
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Therefore, 



TF{Sfs,SfQ) 



r(0) 



1 + Q{-SfsAxuN)/kBT 



(9) 



where 5fs = fs - fm and 



fm 



The expanding sausage model for a Gaussian chain under tension: In the presence 
of an external tension /, the simplest modification to the free energy of the expanding sausage 
model as a function of the length of the sausage (~ R) is given by 



F = -fA-fR 



X 27rpR - fR 



2VnnR^^^ - fR, 



(10) 



where the surface tension for the thermal blob (of size ^) is given as kBT/^'^, and the exposed 
surface area of the cylindrical sausage is A ~ 2n pR (p is the radius of the sausage). The model 
is schematically shown in Fig. S4A. The rate of free energy reduction in the system is given by 

'kBT 



F = -fA- fR 



while the entropy production due to the dissipation is 
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-TS ^r]x Rx R^. 



(12) 



Equating F = —TS, as in the main text, we obtain 

/ 



t/ro 



f\y^ 



if 



2 , 

077 



1-/ 



Ro 



\ 



4 



k=l 



R^ 

Rn 



k/2 



(13) 



where Rq the initial extension of the polymer, tq 



and /° = fro/vRl The ratio R/Ro 
in Eq 13 is interpreted as the survival probability Psi^)^ as in the main text. For /° 0, we 
recover the de Gennes' collapse kinetics in poor solvent without tension, Psit) = 1 ~ | ( ) 
The average folding (collapse) time in the absence of tension, given in Eq. 6 of the main text, 
is (r)o = Jq^^^^° dtSo{t) = |ro. Using the definition of tq and /°, we can rewrite the free energy 
F in the dimensionless form, 



F{R/Ro 



F{R)to 



2{R/Ro 



,1/2 _ 



f (R/Ro 



(14) 
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which is shown in Fig. S4A. 

;;0 



Above / 



4n 



1, the free energy develops a barrier (6F ) for refolding, whose position varies 

cannot be found 
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with force as R^^ /Rq = l/(/ )^. If R^^ < Rq, a vahd solution of SI Eq. 
for all t > (see also the discussion in the main text). The refolding time Tp scales with the 
effective height of a free energy barrier as Tp ~ exp (SF^) where 6F^ is easily calculated as 
SF^ = F^R^^^ / Rq) — F{1) = 1//° — 2 + /°. Therefore, the refolding (or nucleation) time scales 



with the quench force (/ ) as 



Tp ~ exp 



7^ 



2 + / 



(15) 



in parallel to the calculation in the main text. The survival probability for varying / is shown 
in Fig. S4B. The average folding time is determined from Eq. 6, and we see in Fig. S4C that 
Tp increases almost exponentially with the quench force for < 0.6, beyond which there is a 
sharp increase in log[Tp{f^)]. However, we note that the rapid reduction in R in the first stage 
of collapse seen in the simulations (Fig. ID of the main text) is not observed in the predicted 
behavior of i?(t)/i?o for the Gaussian chain (Fig. S4B). 



Barrier height in the WLC-expanding sausage model for fg > (/q)* : When 
7q > (7q)*. Ro/L 1 and R^^L 0. The conditions of f\Ro/L) = and f\r^^ / L) = 



F (Ro/L) = 



and 



F'{R^^/L) = 



lead to 



and 



Rq 

T 



-1/2 



1 



R 



4(l-i?o/L)2 ^« 

TS\ -1/2 



4(1 - i?o/iv)2 4 L 
7c 



fc 



L 



1 i?^^ 

+ 



_A{1-RTS/Ly 4: L _ 



R' 



TS\ -1/2 



fc 




fc 



(16) 



(17) 



:i8) 



(19) 
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The free energy barrier for /q(> (/q)*) can be computed using 



1 1 [3-2/f 



+ 



Q 



I ll„ ( 1 1 
JQ "^fgK fq fq 



(20) 



and 



F{Ro/L) ^2{l-^s 



3/2 



4 \2y/^ 



-6^^ + 85 -/q(1- 25) 



^ + (2 - /q) - (2 - /q + ^/ J + 2/,. + 0{s'/'^ 



(21) 



where s(= fp/fg) when /q takes large value. Thus, the free energy barrier is calculated 
as 



4 



SF' = F{R^^/L) - F{Ro/L) 



27; 



2 + /c 



fq + 




+ 



4 



3/2- 



(22) 



The collapse of a semiflexible chain in a poor solvent : The Hamiltonian used in 
our simulations of a wormlike chain in a poor solvent undergoing collapse to a toroidal structure 
upon force quench is given by 



k k "^"^ 



a 
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a 



■f{zN-z,)] (23) 



1=1 i=l i,j 

with the parameters, slj = I.S/cbT, A;^ = 2000A;Br, AT = 200, a = 0.6 and k^ = 80 A;Br. To 
integrate the equation of motion, we used Brownian dynamics algorithm. 



f{t + At) = f{t) - D^n{{f})At/kBT + R{t). 



(24) 



where R(t) is a vector of Gaussian random number satisfying (R) = and 
{Ra{t) ■ R/sit)) — 2DAt5ap- We use diffusion coefficient D = 1 x 10~^cm^/s, thus characteristic 
time for simulation is a'^/2D 2 ns. We chose integration time step At — 0.00025(a^/2D) 0.5 
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ps. 
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FIG. SI: Initial ensemble for the P5GA simulation. A Structural ensembles of the P5GA hairpin 
represented by P{R) at each fs- B The average end-to-end distance as a function fs- Excluding the 
values at fs = 14 and 16 pN, where the distribution is biomodal, (i?) vs fs is well fit by the wormlike 
chain model {R) ^ L (l - J ^fj^) , yielding L = 12.2 nm and Ip = 0.43 nm. 
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FIG. S2: Analysis of Tp versus fq data by adapting Dudko and coworkers' microscopic model for force 
spectroscopy tfUqJs) = r°{fs) 1 + "^^^f^-^ 1 e 



-AFt_^^/fcsr.[l-(l+i./QAxt,^^/AFt_^)i/-] 



where we use v = 2/3 (cubic potential) and change the sign of force from the one in Ref.|16| to 
consider the refolding process under tension. When Tp is scaled by Tp for each fs, the folding times 
approximately collapse to a single curve (right). From the fit using thick line, the parameters extracted 
for the collapsed data are Ax^ = 0.45 nm and AfI^p = 0.56 pN • nm. 
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R (nm) 



FIG. S3: The free energy of the P5GA hairpin as a function of R near / = /m, schematically illustrating 
how the initially stretched structure of RNA at fs = 18 pN adapts its structure under the force-quench 
condition at fg = 12 pN. 
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FIG. S4: The expanding sausage model under variation of the external tension. A. The free energy 
profile calculated using R/Rq with varying tension (/g). The collapse of a homopolymer is diagrammed 
schematically for fq = 0, but the extended structure can be stabilized in the presence of tension. B. 
Reduction of the molecular extension with increasing tension (from left to right). C. The average 
refolding time as a function of external tension, with log(Ti?) ~ for < 0.6. 
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FIG. S5: Full 50 force-quench induced dynamic trajectories of semiflexible polymers. 
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FIG. S6: The collapse dynamics of a freely jointed homopolymer in a poor solvent is shown, with 
fs = 83 pN to /q = 4 pN (A) and fs = 83 pN to fg = 75 pN (B). The flexible chain shows no 
evidence of the plateau for small fg, but a higher quench force stabilizes intermediate structures. 
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